A better block partition and ligation strategy for individual haplotyping

نویسندگان

  • YuZhong Zhao
  • Yun Xu
  • ZhiHao Wang
  • Hong Zhang
  • Guoliang Chen
چکیده

MOTIVATION Haplotype played an important role in the association studies of disease gene and drug responsivity over the past years, but the low throughput of expensive biological experiments largely limited its application. Alternatively, some efficient statistical methods were developed to deduce haplotypes from genotypes directly. Because these algorithms usually needed to estimate the frequencies of numerous possible haplotypes, the partition and ligation strategy was widely adopted to reduce the time complexity. The haplotypes were usually partitioned uniformly in the past, but recent studies showed that the haplotypes had their own block structure, which may be not uniform. More reasonable block partition and ligation strategy according to the haplotype structure may further improve the accuracy of individual haplotyping. RESULTS In this article, we presented a simple algorithm for block partition and ligation, which provided better accuracy for individual haplotyping. The block partition and ligation could be completed within O(m(2) logm+m(2n)) time complexity, where m represented the length of genotypes and n represented the number of individuals. We tested the performance of our algorithm on both real and simulated dataset. The result showed that our algorithm yielded better accuracy with short running time. AVAILABILITY The software is publicly available at http://mail.ustc.edu.cn/~zyzh.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Complexity of SNP Block Partitioning Under the Perfect Phylogeny Model

Recent technologies for typing single nucleotide polymorphisms (SNPs) across a population are producing genome-wide genotype data for tens of thousands of SNP sites. The emergence of such large data sets underscores the importance of algorithms for large-scale haplotyping. Common haplotyping approaches first partition the SNPs into blocks of high linkage-disequilibrium, and then infer haplotype...

متن کامل

H-PoP and H-PoPG: heuristic partitioning algorithms for single individual haplotyping of polyploids

MOTIVATION Some economically important plants including wheat and cotton have more than two copies of each chromosome. With the decreasing cost and increasing read length of next-generation sequencing technologies, reconstructing the multiple haplotypes of a polyploid genome from its sequence reads becomes practical. However, the computational challenge in polyploid haplotyping is much greater ...

متن کامل

Rapid, long-range molecular haplotyping of thiopurine S-methyltransferase (TPMT) *3A, *3B, and *3C.

BACKGROUND Haplotyping is an important technique in molecular diagnostics because haplotypes are often more predictive for individual phenotypes than are the underlying single-nucleotide polymorphisms (SNPs). Until recently, methods for haplotyping SNPs separated by kilobase distances were laborious and not applicable to high-throughput screening. In the case of thiopurine S-methyltransferase (...

متن کامل

O-36: Genome Haplotyping and Detection of Meiotic Homologous Recombination Sites in Single Cells, A Generic Method for Preimplantation Genetic Diagnosis

Background: Haplotyping is invaluable not only to identify genetic variants underlying a disease or trait, but also to study evolution and population history as well as meiotic and mitotic recombination processes. Current genome-wide haplotyping methods rely on genomic DNA that is extracted from a large number of cells. Thus far random allele drop out and preferential amplification artifacts of...

متن کامل

Long-range, high-throughput haplotype determination via haplotype-fusion PCR and ligation haplotyping

Ligation Haplotyping is a robust, novel method for experimental determination of haplotypes over long distances, which can be applied to assaying both sequence and structural variation. The simplicity and efficacy of the method for genotyping large chromosomal rearrangements and haplotyping SNPs over long distances make it a valuable and powerful addition to the methodological repertoire, which...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 24 23  شماره 

صفحات  -

تاریخ انتشار 2008